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We derive the optimal A'^-photon two-mode input state for obtaining an estimate (j> of the phase 
difference between two arms of an interferometer. For an optimal measurement [B. C. Sanders and 
G. J. Milburn, Phys. Rev. Lett. 75, 2944 (1995)], it yields a variance (A(?!))^ ~ t? IN'\ compared 
to 0{N~^) or 0(A''~^") for states considered by previous authors. Such a measurement cannot 
be realized by counting photons in the interferometer outputs. However, we introduce an adaptive 
measurement scheme that can be thus realized, and show that it yields a variance in very close 
to that from an optimal measurement. 
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Interferometry is the basis of many high-precision mea- 
surements. The ultimate limit to the precision is due to 
quantum effects. This limit is most easily explored for a 
Mach-Zehnder interferometer (see Fig. 1^, where $ should 
be ignored for the moment). The outputs of this device 
can be measured to yield an estimate (f> of the phase dif- 
ference ip between the two arms of the interferometer. It 
is well known that this can achieve the standard quan- 
tum limit for phase sensitivity of (A0)^ ~ N~^ when an 
iV-photon number state enters one input port. Several 
authors ||, g, ||, |[ ||] have proposed ways of reducing the 
phase variance to the Heisenberg limit of ^N^"^ . Here N 
is the fixed total number of photons in the inputs Iq] . 

Most of these proposals [l|, ||, |[ are limited in that 
they require that the phase difference tp between the two 
arms be zero or very small in order to obtain the iV~^ 
scaling. Sanders and Milburn [ffl, M considered an ideal 
or canonical measurement, for which the N^"^ scaling is 
independent of ip. Unfortunately they do not explain 
how this measurement can be performed, and it can be 
shown W that it cannot be realized by counting photons 
in the outputs of the interferometer. In this Letter we 
show that there is an experimentally realizable measure- 
ment scheme using photodetectors and feedback which is 
almost as good as the canonical measurement. 

Before introducing our adaptive scheme, we find the 
optimal input states for the canonical interferometric 
measurements. These will then be used as the input 
states for our adaptive scheme, to demonstrate a (A0)^ 
scaling almost as good as iV~^. The optimal input 
states are interesting in themselves, in that they differ 
significantly from the input states considered in Refs. 
[0, g, g, §, |§1. In particular, our rigorous analysis shows 
that those non-optimal states, in fact, exhibit a worse 
scaling than the standard quantum limit of N^^ . 

The canonical measurement. — Using the same nota- 
tion as Sanders and Milburn [Q, we designate the two 
annihilation operators for the two input modes as a and 
h, and we use the Schwinger representation 

J^ = {a)h + a6t)/2, Jy = {a)h - aP)/2i, (1) 



Jz 



{a)a-Pb)/2, .P = Jl + Jl + Jl 



(2) 



We use the notation \j^i)z for the common eigenstate of 
Jz and J^ with eigenvalues n and i(j + 1), respectively. 
This state corresponds to Fock states with j + ji and j — jjl 
photons entering ports a and 6, respectively. 

From Ref. 0, the optimal probability operator mea- 
sure (POM) for phase measurements is the canonical one, 



E 



2i + l 

27r 



|J0>(J0|#, 



(3) 



where the phase states \j(t>) are defined in terms of the 
Jy eigenstatcs by \j<f>) = (2j + 1)-^/" E'^=-, e'^1jM>y 
In terms of the Jy eigenstates, the POM is 

3 



E{(J3)d(j) = 



1 

2^ 



E 



j(tj,~u)4> 



\jn)y{3u\d4,. (4) 
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FIG. 1: The Mach-Zehnder interferometer, with the addition 
of a controllable phase $ in one arm. The unknown phase to 
be estimated is <pi. Both beam splitters (BS) are 50:50. 



The POM defines the probabiHty distribution for (/), the 
best estimate for the interferometer phase tp, by 

P(0)d0=(V.|^(0)|7^)d0. (5) 

Here \'ip) is the two-mode interferometer input state hav- 
ing N = 2j photons. 

The optimal input state. — A short examination reveals 
that the canonical POM (m has the same matrix ele- 
ments as the POM for ideal measurements on a single 
mode with an upper limit of N on the photon number. 
The optimal state in this case has been considered before 
^ ^. Here we follow the procedure of Ref. Q, which 
minimizes the Holevo phase variance |I(| 



(A0)2 ^V^^S. 



1, 



(6) 



where S^ G [0, 1] is the sharpness of the phase distribu- 
tion, defined as 



Sa, = 



27r 



d(j)P{(J3)e''^'>'-^\ 



(7) 



where the "mean phase" (p is defined so that 5*0 is pos- 
itive. The Holevo variance is the natural quantifier for 
dispersion in a cyclic variable. If the variance is small, 
then it is easy to show that 



Kft 



4 sin 



PWd(b, 



(8) 



from which the equivalence to the usual definition of vari- 
ance for well- localized distributions is readily apparent. 
Using the Holevo variance enables a simple analytic 
solution. The minimum variance is 



tan 



N- 



(N 



0{N- 



(9) 



and the optimal state (chosen here to have a mean rela- 
tive phase of zero) is 



IV'o; 



1 



.pt/ 



VITT 



E 



in + j + l)7r 



2j 



\jl^)y (10) 



To obtain the state in terms of the eigenstates of Jz, 
we use y{jfi\ji^)z = e'(^/2)(''-^)/j?^(7r/2), where g 



iLi^m 



(J~m)!(j+m)! 

ij-iyy. ij + i^y. 



1/2 



p: 



{p—v.4J,+v) 



J-P 



for /i — j/>— 1, /i + i^>— 1, 



(0), 
(11) 



where P„ (a;) are the Jacobi polynomials, and the 
other matrix elements are obtained using the symmetry 
relations If,M = {-ir-^Ii,{9) - /i,._,,(e). 

An example of the optimal state for 40 photons is plot- 
ted in Fig. ^. This state contains contributions from all 
the Jz eigenstates, but the only significant contributions 
are from 9 or 10 states near /x = 0. The distribution 
near the center is fairly independent of photon number 
N = 2j. To demonstrate this, the distribution near the 
center for 1200 photons is also shown in Fig. 0. 




FIG. 2: The coefficients z{JIJ-\tpopt) for the state optimized 
for miniiTium phase variance under ideal measurements. All 
coefficients for a photon number of 2j — 40 are shown as the 
continuous line, and those near n — for a photon number of 
2j — 1200 as crosses. 

In order to compare this state with |jO)j;, where equal 
photon numbers are fed into both input ports (as consid- 
ered in Refs. [pi H, H]), the exact phase variance for this 
case was calculated for a range of photon numbers up to 
25 600. Since the phase must be measured modulo tt for 
this state, rather than using the Holevo phase variance, 
we used the following measure for the dispersion: 

(A0)2 =^20/4= (I (e^'*) 1-2-1)74, (12) 

where the expectation value is again determined using 
(P). The phase variances for this state and the optimal 
state are shown in Fig. |^. The exact Holevo phase vari- 
ance of the state where all the photons are incident on 
one port, \jj)z, is also shown for comparison. 
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FIG. 3: Variances in the phase estimate versus input photon 
number 2j. The lines are exact results for canonical measure- 
ments on optimal states \tpopt) (continuous line), on states 
with all photons incident on one input port \jj)z (dashed 
line), and on states with equal photon numbers incident on 
both input ports \jO)z (dotted line). The crosses are the nu- 
merical results for the adaptive phase measurement scheme on 
|7/;opt), and the plusses are those on a \jO)z input state. The 
circles are numerical results for a nonadaptive phase measure- 
ment scheme on |i/'opt). AH variances for the \jO}z state are 
for phase modulo n. 



As can be seen, the phase variance for \jO)z scales down 
with photon number much more slowly than the phase 
variance for optimal states |ll|, and even more slowly 
than the variance for \jj)z, which scales as N~^. In fact, 
this figure shows that the phase variance for \jQ)z scales 
as 7V~^/^, which agrees with what can be calculated from 
the asymptotic formula for P{(j)) given in Ref. J5|. This is 
a radically different result from the N~'^ scaling found in 
Refs. UR i- The state (|jO)^ + |jl)^)/V2, considered 
in Refr|g[J is even worse than the state |jO)^. 

The reason for this discrepancy is that the results 
found in Refs. [^, ^, g are all based on the width of 
the central peak in the distribution, but the main con- 
tribution to the variance is from the tails of the distribu- 
tion. This can be seen from the probability distribution 
multiplied by sin cj), since Eqs. (H) and (n2h imply that 
{A^f ~ J sin^(</) - ip)P{(j))d(j}. We plot this in Fig. | for 
A/' = 80 and ip — Q. In practice this means that the error 
in the phase will be small most of the time, but there will 
be a significant number of results with a large error. 

Adaptive measurements. — Although the quantum in- 
terferometry problem is now formally solved, this is of 
little practical use because (even if the optimal input 
states could be produced) it is not known how to imple- 
ment the canonical measurement scheme. In particular, 
it is impossible to implement it by counting photons in 
the two output ports of the interferometer pj , as an ex- 
perimenter would expect to do. Nevertheless, as we will 
show, it is possible to closely approximate the canonical 
measurement by counting output photons if one makes 
the measurement adaptive. The situation is as in Fig. |^. 
The unknown phase we wish to measure, (^, is in one arm 
of the interferometer, and we introduce a known phase 
shift, $, into the other arm of the interferometer. Af- 
ter each photodetection we adjust this introduced phase 
shift in order to minimize the expected uncertainty of our 
best phase estimate after the next photodetection. 

The annihilation operators ci and cq for the two output 
modes shown in Fig. |l| are related to the inputs by 

^ ^ in — (^ -]- U-n 7 ip — <^ ^ iiTT 



asm - 



6 cos 



(13) 
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FIG. 4: The canonical phase probability distribution for |jO)2 
multiplied by sin^ (f) for 2j — 80 photons. 



Before the mth photon has been detected, the phase <& 
will be fixed to the value <&„ by an adaptive algorithm 
to be specified later. Clearly the feedback loop which ad- 
justs $„j must act much faster than the average time be- 
tween photon arrivals. It is the ability to change $„ dur- 
ing the passage of a single (two-mode) pulse that makes 
photon counting measurements more general than a mea- 
surement of the output Jz considered in Refs. [g| ||. 
An adjustable second phase $ is of use even without 
By setting 



feedback |12 



$m = $n 



N 



(14) 



where $o is chosen randomly, an estimate <j) oi Lp can be 
made with an accuracy independent of (p. However, as 
we will show, this phase estimate has a variance scaling 
as O(iV-i) rather than 0{N-'^). 

Let us denote the result u from the mih detection as 
Ura (which is or 1 according to whether the photon is de- 
tected in mode cq or ci), and the measurement record up 
to and including the mth detection as the binary string 
n„i = Um ■ ■ ■ U2U1. The input state after m detections 
will be a function of the measurement record and ip, and 
we denote it as \'ip{nm, p)). It is determined by the initial 
condition \^j{nQ,ip)) = |^) and the recurrence relation 



|VJ(m^ 



L,(/?)) = C„„^ ((/?,$„) 



|V'(nm-i,(p)) 



(15) 



These states are unnormalized, and their norm repre- 
sents the probability for the record rim, given ip, 

F(n„i|(^) = {-ipinm, p)\ip{nm, ^))- (16) 

Thus the probability of obtaining the result u^ at the 
mth measurement, given the previous results rim-i, is 



F(m„|(/?, n„ 



-I) = 



{^p {Urnn„i-i, ip)\lp{Umn„i-l, p)) 

{'ip{nm~i, p)\i^{nra-i, p)) 



(17) 
Also, the posterior probability distribution for Lp is 

P(ip\n,n) = Afm{nrn){i^{nm, ^)\'ip{nm, ^)) , (18) 

where JVm{nm) is a normalization factor. To obtain this 
we have used Bayes' theorem assuming a flat prior dis- 
tribution for ip (that is, an initially unknown phase). A 
Bayesian approach to interferometry has been considered 
previously, and even realized experimentally |I3]. How- 
ever, this was done only with nonadaptive measurements 
and with all particles incident on one input port. 

With this background, we can now specify the adaptive 
algorithm for $„i. The sharpness of the distribution after 
the ?7T,th detection is given by 



^ipyUn 



l) = 



27r 



P{ip\umnm-i)e"^dip 



(19) 



We choose the feedback phase before the mth detection, 
$,„, to maximize the sharpness. Since we do not know 
u„i beforehand, we weight the sharpnesses for the two al- 
ternative results by their probability of occurring based 



on the previous measurement record. Therefore the ex- 
pression we maximize is 



Af ($„>„) = 



.m=0,l 



P{Um\nm~l)S^{Umnrn~l)- (20) 



Using Eqs. (p7|), (p^), and (p^, and ignoring the constant 
A/'m('T-m), the maximand can be rewritten as 



E 



27r 



{^p{umnm-i, (p)\ip{umn„i~i, (p))e'"^dip 



.... (21) 
The controUed phase $,„ appears imphcitly in ( |2l| ) 
through c„^ (((5, $,jj in (O), which appears in the re- 
currence relation dlq) . The maximizing solution <!>„ can 
be found analytically, but we will not exhibit it here. 

The final part of the adaptive scheme is choosing the 
phase estimate (j) oi Lp from the complete data set um- 
To maximize Re[(e**^'^~'''^)] (that is, to minimize the de- 
viation of (j) from ip) , we choose (j) to be the mean of the 
posterior distribution P{Lp\nN)^ which from Eq. (nSf) is 



arg 



2ir 



{i^{nM,^)\i^{nN.v)y^d^. (22) 



We can determine the approximate phase variance 
(A0)^ under this adaptive scheme using a stochastic 
method. The phase Lp is taken to be zero, which leads 
to no loss of generality since the initial controlled phase 
$1 is chosen randomly. The measurement results are de- 
termined randomly with probabilities determined using 
Lp — Q, and the final estimate (j) determined as above. 
From Eq. (|g), an ensemble {^/ilf^Li of M final esti- 
mates allows the phase variance to be approximated by 



(A0)2 



-i + |M"^E:Li 



£,»0H 



It is also possible to 



determine the phase variance exactly by systematically 
going through all the possible measurement records and 
averaging over <I>i. This method is feasible only for pho- 
ton numbers up to 20 or 30, however. 

The results of using this adaptive phase measurement 
scheme on the optimal input states determined above are 
shown in Fig. |[ The phase variance is very close to the 
phase variance for ideal measurements, with scaling very 
close to N^"^. The phase variances do differ relatively 
more from the ideal values for larger photon numbers, 
however, indicating a scaling slightly worse than iV~^. 
It is possible that the actual scaling is log N/N'^, as is 
the case for optimal single-mode adaptive phase measure- 
ments based on dyne detection [^ |l4) . For comparison, 
we also show the variance from the nonadaptive phase 
measurement defined by Eq. (|l4|). As is apparent, this 
has a variance scaling as N^^. Finally, the results for 
\jO)z input states with this adaptive phase measurement 
scheme (modified to estimate (p modulo n, as we must 
for these states) are also shown. The variances are again 
close to those for ideal measurements, scaling as N^^^^. 

To conclude, we have shown that although the \jO)z 
input states considered by previous authors do not give 
a phase variance that scales as 1/N'^ under ideal mea- 
surements, it is straightforward to derive optimal input 
states that do give this scaling. These states require sig- 
nificant contributions from only 9 or 10 j^ eigenstates 
(photon eigenstates for the two input modes). We have 
also demonstrated a practical measurement scheme (i.e., 
one based on photodetection of the output modes) to 
approximate ideal measurements. This scheme, which 
uses feedback, allows phase measurements with a vari- 
ance scaling close to 1/7V^ with optimized input states. 



[1] 

[2] 

[3] 
[4] 
[5] 
[6] 

[7] 



C. M. Caves, Phys. Rev. D 23, 1693 (1981). 
B. Yurke, S. L. McCall, and J. R. Klauder, Phys. Rev. A 
33, 4033 (1986). 

M. J. Holland and K. Burnett, Phys. Rev. Lett. 71, 1355 
(1993). 

B. C. Sanders and G. J. Milburn, Phys. Rev. Lett. 75, 
2944 (1995). 

B. C. Sanders, G. J. Milburn, and Z. Zhang, J. Mod. 
Opt. 44, 1309 (1997). 

Except for Ref. M, the above proposals have involved 
input states where the total number of photons is fixed. 
For simplicity, we follow this convention. 
We have found that for N = 3 and N = 4 our feedback 
scheme as applied to optimum input states is globally 
optimum, and also that the phase variance is above the 
canonical phase variance. This demonstrates that canoni- 
cal measurements cannot be realized by counting photons 



in the interferometer outputs, even if feedback is allowed. 
[8] G. S. Summy and D. T. Pegg, Opt. Commun. 77, 75 

(1990). 
[9] H. M. Wiseman and R. B. Killip, Phys. Rev. A 56, 944 

(1997). 
[10] A. S. Holevo, Springer Lecture Notes Math. 1055, 153 

(1984). 
[11] The phase variance starts out less than that for the op- 
timum state, but this is merely due to using a slightly 

different measure of the variance. 
[12] Z. Hradil et al, Phys. Rev. Lett. 76, 4295 (1996). 
[13] H. M. Wiseman and R. B. Killip, Phys. Rev. A 57, 2169 

(1998). 
[14] D. W. Berry and H. M. Wiseman, Phys. Rev. A 63, 

013813 (2001). 



